Discriminative Learning of Probabilistic Sequence Models for Sequence Labeling Problems

نویسندگان

  • Minyoung Kim
  • Yushi Jing
  • Vladimir Pavlovic
  • James M. Rehg
چکیده

The problem of labeling (or segmenting) sequences is very important in many applications such as part-of-speech tagging in natural language processing, multimodal object detection in computer vision, and DNA/protein structure prediction in bioinformatics. Conditional Random Fields (CRFs) of [1] are known to be the best sequence models ever for the problem. CRF is a conditional model, P (s|y), infered from a (joint) log-linear model P (s,y), which is derived from Maximum Entropy (ME) principle. Since the dual formulation of ME results in Maximum Likelihood (ML) objective, it is natural to learn CRF via Conditional Likelihood Maximization (CML). Recently, diverse numerical optimization methods for CML (e.g., IIS, Conjugate Gradient, and Quasi-Newton) together with their convergence rates as well as prediction performances have been studied (e.g., [2]).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning Discriminative Relational Features for Sequence Labeling

Discovering relational structure between input features in sequence labeling models has shown to improve their accuracy in several problem settings. However, the search space of relational features is exponential in the number of basic input features. Consequently, approaches that learn relational features, tend to follow a greedy search strategy. In this paper, we study the possibility of opti...

متن کامل

Comparative Gene Prediction using Conditional Random Fields

Computational gene prediction using generative models has reached a plateau, with several groups converging to a generalized hidden Markov model (GHMM) incorporating phylogenetic models of nucleotide sequence evolution. Further improvements in gene calling accuracy are likely to come through new methods that incorporate additional data, both comparative and species specific. Conditional Random ...

متن کامل

RNA sequence analysis using covariance models.

We describe a general approach to several RNA sequence analysis problems using probabilistic models that flexibly describe the secondary structure and primary sequence consensus of an RNA sequence family. We call these models 'covariance models'. A covariance model of tRNA sequences is an extremely sensitive and discriminative tool for searching for additional tRNAs and tRNA-related sequences i...

متن کامل

An Empirical Evaluation of Sequence-Tagging Trainers

The task of assigning label sequences to a set of observed sequences is common in computational linguistics. Several models for sequence labeling have been proposed over the last few years. Here, we focus on discriminative models for sequence labeling. Many batch and online (updating model parameters after visiting each example) learning algorithms have been proposed in the literature. On large...

متن کامل

­­Image Segmentation using Gaussian Mixture Model

Abstract: Stochastic models such as mixture models, graphical models, Markov random fields and hidden Markov models have key role in probabilistic data analysis. In this paper, we used Gaussian mixture model to the pixels of an image. The parameters of the model were estimated by EM-algorithm.   In addition pixel labeling corresponded to each pixel of true image was made by Bayes rule. In fact,...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006